Re-annotating the Mycoplasma pneumoniae genome sequence: adding value, function and reading frames.

نویسندگان

  • T Dandekar
  • M Huynen
  • J T Regula
  • B Ueberle
  • C U Zimmermann
  • M A Andrade
  • T Doerks
  • L Sánchez-Pulido
  • B Snel
  • M Suyama
  • Y P Yuan
  • R Herrmann
  • P Bork
چکیده

Four years after the original sequence submission, we have re-annotated the genome of Mycoplasma pneumoniae to incorporate novel data. The total number of ORFss has been increased from 677 to 688 (10 new proteins were predicted in intergenic regions, two further were newly identified by mass spectrometry and one protein ORF was dismissed) and the number of RNAs from 39 to 42 genes. For 19 of the now 35 tRNAs and for six other functional RNAs the exact genome positions were re-annotated and two new tRNA(Leu) and a small 200 nt RNA were identified. Sixteen protein reading frames were extended and eight shortened. For each ORF a consistent annotation vocabulary has been introduced. Annotation reasoning, annotation categories and comparisons to other published data on M.pneumoniae functional assignments are given. Experimental evidence includes 2-dimensional gel electrophoresis in combination with mass spectrometry as well as gene expression data from this study. Compared to the original annotation, we increased the number of proteins with predicted functional features from 349 to 458. The increase includes 36 new predictions and 73 protein assignments confirmed by the published literature. Furthermore, there are 23 reductions and 30 additions with respect to the previous annotation. mRNA expression data support transcription of 184 of the functionally unassigned reading frames.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Agent-Based System for Re-annotation of Genomes

Genome annotation projects can produce incorrect results if they are based on obsolete data or inappropriate models. We have developed an automatic re-annotation system that uses agents to perform repetitive tasks and reports the results to the user. These tasks involve BLAST searches on biological databases (GenBank) and the use of detection tools (Genemark and Glimmer) to identify new open re...

متن کامل

Comparative analysis of the genomes of the bacteria Mycoplasma pneumoniae and Mycoplasma genitalium.

The sequenced genomes of the two closely related bacteria Mycoplasma genitalium and Mycoplasma pneumoniae were compared with emphasis on genome organization and coding capacity. All the 470 proposed open reading frames (ORFs) of the smaller M.genitalium genome (580 kb) were contained in the larger genome (816 kb) of M.pneumoniae. There were some discrepancies in annotation, but inspection of th...

متن کامل

Proteogenomic mapping as a complementary method to perform genome annotation.

The accelerated rate of genomic sequencing has led to an abundance of completely sequenced genomes. Annotation of the open reading frames (ORFs) (i.e., gene prediction) in these genomes is an important task and is most often performed computationally based on features in the nucleic acid sequence. Using recent advances in proteomics, we set out to predict the set of ORFs for an organism based p...

متن کامل

Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae.

The entire genome of the bacterium Mycoplasma pneumoniae M129 has been sequenced. It has a size of 816,394 base pairs with an average G+C content of 40.0 mol%. We predict 677 open reading frames (ORFs) and 39 genes coding for various RNA species. Of the predicted ORFs, 75.9% showed significant similarity to genes/proteins of other organisms while only 9.9% did not reveal any significant similar...

متن کامل

Proteins P24 and P41 function in the regulation of terminal-organelle development and gliding motility in Mycoplasma pneumoniae.

Mycoplasma pneumoniae is a major cause of bronchitis and atypical pneumonia in humans. This cell wall-less bacterium has a complex terminal organelle that functions in cytadherence and gliding motility. The gliding mechanism is unknown but is coordinated with terminal-organelle development during cell division. Disruption of M. pneumoniae open reading frame MPN311 results in loss of protein P41...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic acids research

دوره 28 17  شماره 

صفحات  -

تاریخ انتشار 2000